Two-level Throughput and Latency IO Control for Parallel File Systems

نویسندگان

  • Yiqi Xu
  • Ming Zhao
چکیده

Existing parallel file systems are unable to provide both throughput and response time guarantees for concurrent parallel applications. This limitation prevents different, competing applications from getting their desired performance as high-performance computing (HPC) systems continue to scale up and be used in a shared environment. This paper presents a new two-level scheduler for parallel storage systems, a new solution to address this challenge based on a distributed performance virtualization layer for parallel file systems (vPFS). It provides both bandwidth proportional sharing and response time guarantees by addressing them at different levels of the scheduler in a cooperative manner. The utility and performance of this scheduler are studied on PVFS2, a widely used parallel file system. An experimental evaluation using a typical HPC benchmark (IOR) shows that when the storage is not overloaded, requests complete within 95th percentile response time bound during 90% of the time. The scheduler can further favor more latencysensitive application under overloaded case.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MPI-IO In-Memory Storage with the Kove XPD

Many scientific applications are limited by the performance offered by parallel file systems. SSD based burst buffers provide significant better performance than HDD backed storage but at the expense of capacity. Clearly, achieving wire-speed of the interconnect and predictable low latency I/O is the holy grail of storage. Throughput and latency of in-memory storage promises to provide optimal ...

متن کامل

Falcon: Scaling IO Performance in Multi-SSD Volumes

With the high throughput offered by solid-state drives (SSDs), multi-SSD volumes have become an attractive storage solution for big data applications. Unfortunately, the IO stack in current operating systems imposes a number of volume-level limitations, such as pervolume based IO processing in the block layer, single flush thread per volume for buffer cache management, locks for parallel IOs on...

متن کامل

U.S. Department of Energy Best Practices Workshop on File Systems & Archives: Usability at Los Alamos National Lab

There yet exist no truly parallel file systems. Those that make the claim fall short when it comes to providing adequate concurrent write performance at large scale. This limitation causes large usability headaches in HPC computing. Users need two major capabilities missing from current parallel file systems. One, they need low latency interactivity. Two, they need high bandwidth for large para...

متن کامل

sRoute: Treating the Storage Stack Like a Network

In a data center, an IO from an application to distributed storage traverses not only the network, but also several software stages with diverse functionality. This set of ordered stages is known as the storage or IO stack. Stages include caches, hypervisors, IO schedulers, file systems, and device drivers. Indeed, in a typical data center, the number of these stages is often larger than the nu...

متن کامل

A scalable MPI implementation of an ad-hoc parallel I/O system

This paper presents the novel design, implementation and evaluation of an ad-hoc parallel I/O system (AHPIOS). AHPIOS is the first scalable parallel I/O system completely implemented in Message Passing Interface (MPI). The MPI implementation brings the advantages of portability, scalability and high-performance. AHPIOS allows MPI applications to dynamically manage and scale distributed partitio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013